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Abstract 

We study exit times from a set for a family of multivariate autore- 
gressive processes with normally distributed noise. By using the large 
deviation principle, and other methods, we show that the asymptotic 
behavior of the exit time depends only on the set itself and on the 
covariance matrix of the stationary distribution of the process. The 
results are extended to exit times from intervals for the univariate 
autoregressive process of order n, where the exit time is of the same 
order of magnitude as the exponential of the inverse of the variance 
of the stationary distribution. 

1 Introduction 

We consider a multivariate autoregressive process with normally distributed 
noise, defined through 

X e t =AXU + eZu t>l, X £ =x , 

where Xf G M. d , A is a real d x d matrix , e is a small positive parameter 
and {£t}t>i is an i.i.d. sequence of multivariate standard normal random 
variables. We will study the time until the process exits from a set of the 
type {x G R d : \c T x\ < 1} for some vector c G M. d . Subject to some conditions, 
we show that the expectation of this exit time is of the order of magnitude 
exp(l/ (e 2 c T E oc c)) for small values of e, where e 2 Soo is the covariance matrix 
of the stationary distribution of the process. 
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The corresponding univariate case, where X t e 6 1 Vt > has been inves- 
tigated before, by Klebaner and Liptser in [2] and by Ruths in In [2J, 
the authors proved a large deviation principle (LDP) for a class of past- 
dependent models. As an example, they used the univariate autoregressive 
process {-X"f}t>o, where 

X\ = aXl_ x + e&, Xq = xq, 

where Xf G R Vt, |a| < 1, e is a positive parameter and {£*}t>i is an i.i.d. 
sequence of standard normal random variables. This process has a stationary 
distribution which is normal with mean and variance 1/(1 — a 2 ). Klebaner 
and Liptser showed that the family of processes {X^} t >o obeys an LDP with 
rate of speed e 2 and rate function 



Ku) 



\ J2tll( U t - OMt-l) 2 , M = X , 

oo, otherwise, 



where u = (uq, u±, . . .), and that this implies that 

limsup£ 2 log£r £ < J(l - a 2 ), (1) 

when t £ := min{t > 1 : \Xl\ > 1}. This upper bound is sharp, considering 
pE], where the corresponding lower bound was proven with another method. 
We also note the correspondence between this bound and the variance of the 
stationary distribution. 

In section 2 of this paper, we establish the corresponding large deviation 
principle for a family of multivariate processes. We also present a method 
to get a lower bound for the exit time of normally distributed processes. 
In section 3 we prove the asymptotics of the exit time of the multivariate 
autoregressive process. In section 4, we apply the same methods to get a 
result for the exit time from an interval for the univariate autoregressive 
process of order n, where 

XI = aiX^ + . . . + a n Xl_ n + e£ t , t > n, Xq = xq, . . . , X e n _ x = x n _i, 

where a%, . . . , a n are real parameters and {(,t}t>n is a sequence of univariate 
standard normal random variables. 
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2 Methods for upper and lower bounds 



In the first two parts of this section, we consider how to use the large deviation 
principle to get an upper bound of the asymptotics of an exit time from a set 
for a process. In the third part of the section, we consider another method for 
the corresponding lower bound, when the process has a normal distribution. 

2.1 The large deviation principle 

The following definition of the large deviation principle is taken from Varad- 
han ([3]), with the slight difference that we let the rate of speed be a function 
of e and call it q(e), as Klebaner and Liptser also did ([2])- The large de- 
viation principle (LDP) is then defined in the following way: Let {P £ } be a 
family of probability measures on the Borel subsets of a complete separable 
metric space Z. We say that {P E } satisfies the large deviation principle with 
a rate function /(•) if there exists a function I from Z into [0, oo] satisfying 
the following conditions: < I(z) < oo G Z, I is lower semicontinuous, 
the set {z : I(z) < 1} is a compact set in Z for all I < oo and 

lim sup q(e) log P £ {C) < — inf I(z) for every closed set C C Z and 

e^O 2 ec 

lim inf q(e) log P £ {G) > — inf I(z) for every open set G G Z. 

e— >0 z£G 

We will consider a family of processes {X^} t > Q , where X\ e M. d Vt > and 

XI = f(XU, Xt_ n) ei t ) for t > n, (2) 

where / : (IR^)"-" 1 " 1 h-> is a continuous function, {£t}t>n is an i-i-d.- sequence 
of random variables in R d , e is a positive parameter and the starting values 
are Xq = xq, . . . ,X^__ X = x n -\. We will prove a large deviation principle 
for the family of probability measures induced by {Xf }f> , assuming that a 
large deviation principle for the family of probability measures induced by 
{e£ n } holds. 

Theorem 2.1. Assume that the family of probability measures induced by 
where £ is a copy of £ nj satisfies a large deviation principle with rate 
function I e ^(z) and rate of speed q(e) . Then the large deviation principle holds 
for the family of probability measures induced by {X% }t>o with the same rate 
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of speed and the rate function 



I(y Q ,yi,y 2 ,...) = inf S^I^t). 

z t eR d Vt>n 

W=f(vt-l,—,Vt-n,zt),t>n ~ 
y =x ,...,y n - 1 =x n -i 

Proof: We have assumed that an LDP holds for the family of probability 
measures induced by the family {el;}, with the rate function I £ ^{z) and the 
rate of speed q(e). By [3], the LDP then holds for the family of probability 
measures induced by the family of vectors {e^ t }tL n with the same rate of 
speed and the rate function 

N 
t=n 

where N is finite. By the Dawson-Gartner theorem (see for example [I]), it 
follows that the LDP holds for the family of probability measures induced 
by {e^t}t>n with rate of speed q{e) and rate function 

oo 
t=n 

Now, since / is continuous, the mapping {e^t\t>n ^ {Xf}t>o is continuous 
in the space (R ') 00 with the metric p(x,y) = Ylj>i t+^~^]| > where || ■ || 
denotes the Euclidian norm on Thus, we can use the contraction principle 
(see for example [1]). It implies that the LDP for the family of probability 
measures associated with the family {X t e } t > holds with rate of speed q(e) 
and rate function 

I(V0,Vl,V2,---)= inf r , I{et t } t>n (Zn,Zn+l,---), 

(z n ,z n+1 ,...)£(R d )°° 

yt=f(Vt-l,—iVt-n,Zt),t>n 
y =xo,...,y n -i=x n -\ 

where the infimum over the empty set is taken as oo. We can write this rate 
function as 

oo 

I(Vo,yi,V2, •••) = inf y)^(zt) 

z t eK d Vt>n 

— t — Tl 

yt=f(yt-i,---,yt-n,z t ),t>n 

y =x ,...,y n _ 1 =x n _ 1 

and the proof is finished. 
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2.2 Exit times with the large deviation principle 

We will later use the large deviation principle to get a bound for the exit 
time from a set for a certain process. To see how this will be done, let us for 
the moment define the exit time as 

r := min{t > n : X £ t <£ Q}, (3) 

where Xf is defined as in equation [2] and Q is a set in M. d . Assume that the 
starting points xq, ■ ■ ■ , x n -i of the process belong to Q. For the expectation 
of the exit time, we have the following, where M is any integer greater than 
or equal to n: 

E X0 ,..., Xn _ 1 (r) < M + P(r>M-l)E XQ _ Xn _ 1 (r\r>M-l) 

< M + P(r> M -1)[MP(t = M\t> M -1) 

+ (M + sup E X0 _ Xn _ 1 (r))P(r > M\r > M — 1)] 

xo,...,x n -l£d 

< 2M+ sup E X0 _ Xn _ 1 (r)- sup P Xo ,..., Xn _ 1 (r > M). 

X o,..., Xn -i£Q xo,...,x n -i£Q 

Thus, it holds that 



sup ^ ,..., :rn _ 1 (r) < 



xo,-,x n -ieU mI a;o,...,a; n _ign "i ,™^ti-lV T — iV1 ) 

or simply that 

2M 

' " : (T) " inf, ,-./>, Ar<M) !1) 

for any set of starting points Xq, . . . , x n -\ G Q and any integer M > n. If the 
infimum in the denominator is attained for the starting points Xq, . . . , x* n _ x G 
Q, the inequality above implies that 

limsupg(£)logK E0 ,... )!E „_ 1 (T) < - lim q{e) log P x *,...,xi < M), (5) 

if the right hand side limit exists. Since 

Pxi,..^ n _Jj <M)= P x *,... K _ l (Xt i A for some t G {n, . . . , M}), 

the limit may be calculated if we have a large deviation principle for the 
family of probability measures induced by {Xf} t >o and if the function / and 
the set Q are suitable. 

In sections 3 and 4 we will use this method to get upper bounds for exit 
times for multivariate autoregressive processes and univariate processes of 
order n, respectively. 
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2.3 A lower bound for the exit time of normally dis- 
tributed variables 

We now leave the large deviation principle for a moment, and consider a 
method to get a lower bound for an exit time. The following theorem gives 
a lower bound for the asymptotics of the mean exit time from a symmetric 
interval for a sequence of univariate normally distributed random variables 
{Y t £ } t >i, with mean zero and bounded variance. Thus, in this section we 
consider the exit time 

r ( _i,i) :=min{t > 1 : \Y t e \ > 1}. 

Theorem 2.2. Assume that {Y/}t>i is a sequence of normally distributed 
random variables, all with mean 0, and that 

Var(Y/) < q(e)a 2 Vt > 1, 

for some a 2 > and some positive function q(e) where lim^o q(e) = 0. Then 

liminfg(e)log^r ( _i ) i ) > 

Proof: Since Y t £ has a normal distribution with mean zero and variance 
bounded by q{e)a 2 , E(e XY *) < exp(|A 2 g(5)<r 2 ) Vt > 1, which implies that 
also 

£(cosh(A>7)) < e^ x2q(£)a2 Vt > 1. 

For any N > 1, we have the following Chernoff-type bound of the probability 
that the exit time is smaller than or equal to iV: 

P(r ( _ in < N) = P( max \Y, £ \ > 1) = P(cosh(A max |K £ |) > cosh A) 

v 1 ' ' - ' \<t<m ~ i<t<[N] 1 * 17 - ' 

< (coshA)" 1 P(cosh(A max \Y £ \)), 

~ l<t<[N] 

which holds for any positive A. (Of course, the bound holds for any A G R.) 
Since 

[N] 

cosh(A max \Y, £ \) = max cosh(AY/) < y^cosh(AF/), 

l<t<[N] l<t<[N] V ~{ 



it follows that 



£(cosh(A max \Y £ \)) < [iVle^ 2 ^ 2 < Ne^ 2 ^ 2 . 

l<t<[N] 
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Thus, we have the bound 

P(t(-i,i) <N)< (cosh \y 1 Ne^ x2q{ - £S)a2 < 2e~ x Ne^ 2q{e)a \ 

for any A > 0. By choosing A in the optimal way, that is, as A = 1/ \q(e)a 2 ), 
we get the bound 

P(r ( _ M) <iV)<2i V exp(- i -i 5 - 5 ). 

Now, let 5 be a small positive number and choose N = exp( 2g ^ o . 2 — ^y)- 
Then P(t(_i ; i) > N) > 1 - 2exp(-^y), which implies that 

*n-.,, a ^(n-M> > AO > ex P( ^i- 5 - - j^-A),, 

and thus 

limTrif g(e) log J Br ( _ ljl) > — - 5. 
Since this holds for any 5 > 0, we get the lower bound 

liminf q(e) log Er^ ltl) > 
and the proof is finished. 

Remark: If we wanted to consider a one-sided exit time, for example the 
time until Y t £ > 1, we could simply use the exponential function instead of 
the hyperbolic cosine in the argument above. The resulting lower bound 
would be the same. 

3 Exit times for a multivariate autoregressive 
process 

In this section, we use the methods described in section 2 to show that the 
expectation of the exit time from a set {x e M. d : \c T x\ < 1} for a vector 
c G M. d (that is not the zero vector) for a multivariate autoregressive process 
is of the order of magnitude exp(l/(e 2 c T E 00 c)), where e 2 ^^ is the covariance 
matrix of the stationary distribution of the process. 
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3.1 A multivariate autoregressive process 

By a multivariate autoregressive process, we mean a process {Xf }t>o, such 
that 

Xf = AXf_ 1 +eZ t , X s =x , (6) 

where Xf e M. d Vt, A is a real d x d matrix, e is a positive parameter and 
{£t}t>i is an i-i-d. sequence of multivariate normal random variables in M. d , 
with mean zero and covariance matrix / (the unit matrix). For any t > 1, 
X\ has a multivariate normal distribution with mean EXf = A 1 xq, where xq 
is the starting point, and covariance matrix £ 2 E t , where 

E t = AE t -iA T + 7, t>2, (7) 

and Ei = /. The matrix E t can also be written as the sum 

E t = f^A i (A T ) i ,t>l. (8) 

Throughout, we will assume that all eigenvalues of A have absolute values 
less than one. The process {X t e } t > then has a stationary distribution, which 
is multivariate normal with mean (0,0, ...,0) T and covariance matrix e E^, 
where Eoo satisfies 

Eoo = A^A T + I. (9) 
Of course, the matrix E^ can also be expressed as the sum 

oo 

^ OQ = J2^(A T y. (io) 

i=0 

3.2 Exit times for the multivariate autoregressive pro- 
cess 

For the multivariate autoregressive process {X t e } t > , we will consider the exit 
time 

r := min{t > 1 : \c T X £ t \ > 1}, (11) 

where c is a vector in M. d , (0, . . . , 0) T . We will find the limit of e 2 log Et 
as e — > 0, by using the methods described in section 2. For the upper bound, 
we will use the large deviation principle, so we need the following corollary. 
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Corollary 3.1. The family of probability measures induced by {X^} t > , where 
XI is defined as in equation® satisfies the large deviation principle with rate 
of speed q(e) = e 2 and rate function 

I(yo, yi, • • •) = n ^2(Vt ~ Ay t -i) T (y t - Ay t -i), 



2 

t=i 



where y = x . 



Proof: By using Cramer's theorem (see for example [T]), one can show that 
the family of probability measures induced by the family where £ is 

multivariate normal with mean zero and covariance matrix /, satisfies the 
LDP with rate of speed e 2 and rate function 

J eC (z) = X -z T z, z e R d . 

By using Theorem I2.1[ we can deduce that the family of probability measures 
induced by {X^} t > Q satisfies the LDP with the same rate of speed e 2 and rate 
function 



I(yo,yi,---) = inf. yZ^zfzt 



*=1 

yt=Ay t -i+Z t 

yo=xo 



o ^2(yt ~ A yt-i) T (yt - ^-1), 



2 



where y = x . 



For the exit time of a multivariate autoregressive process, starting at the 
origin, we will prove the following theorem: 

Theorem 3.2. For the exit time r = min{t > 1 : |c T X t e | > 1}, where 
{X t £ } t > is the multivariate autoregressive process defined in equation® and 
x = (0,...,0) T , 

lim e 2 log Et 



s^o & 2c T E 00 c' 

where £ 2 £oo is the covariance matrix of the stationary distribution of the 
process. 
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Proof: The theorem follows from lemmas 13.31 and 13.41 below. 
Lemma 3.3. For {Xf} t > and r as in Theorem \3.S\ we have 

lim sup e 2 log E Xo r < — — — , 

for any xq such that |c T Xo| < 1. 

Proof: Consider the exit time r = min{t > 1 : |c T X t e | > 1}. This means that 
we consider exits from the set Q := {x G lR d : \c T x\ < 1}. For this set fl, 
xnf Xo e n P Xo (t < M) = -P(o,...,o) T ( r < M), since Xf has a normal distribution 
with mean A 1 xq. Thus, inequality [5] in section 2.2 implies that 

lim sup e 2 log E Xo r < — lim e 2 log Pr \t (t < M) , 

where the right hand side limit can be calculated with the LDP that was 
proven in corollary 13.11 Since 

{ T < M\ = { max \c T X £ A > 1}, 

we have 



lime 2 log P (0 ,...,of(t < M) = - inf \ f](y t -Ay t ^) 7 \y t -Ay-, 

maxKKju |c J yt\>l, I 

yo=(o,-,o) T 

Consider this infimum. The following holds: 



t-lj 



in U , ^^(z/t-^t-i) T (2/t-^t-i) 

CM |c J Jft|>l ^ t=1 



maxi<K M |c J y t | 

yo=(o,...,or 

/ 



inf 

1<N<M 



inf 

1<N<M 



inf \ YVy t - Ay t ^) T {y t - Ay t ^) 

\c t Vn\>1 2 ^ 



\»o=(0,...,0) 

/ 

\</o=(0,..'.,0) 



c T y N \>i 2 
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where the last equality holds because we can choose y t = Ay t _i for t > N. 
We can write y^ as the telescoping sum YltLi ^^{Vt ~~ Ay t _i), when y = 
(0, . . . , 0) T . By using the Cauchy-Schwarz inequality, we get 



N \ / N 



\t=l J \t=l 

f N \ 2 

> (^A^-Ay^)) =(cW- 



K t=i 

Equality in the Cauchy-Schwarz inequality is attained when y t — Ay t _i = 
K(A N ~ t ) T c, t — 1, . . . , N, for any constant ifel. This holds when 

V* = K (E ( AN ^ C = KUA N - l ) T c, for t = 1, . . . , N, 

where S t is defined as in section 3.1. By choosing K = 1/ {\c T T lN c\) ) we get 
\ cT Un\ = 1- Thus, we have now shown that 

j/o=(0,...,0) T 

Since S 4 = E£J^ < (^ T ) < and 

c T S t c = c T S^ic + JA'-^A'-Yc > c T S t _ic V* = 2, . . . , M, 
{c T S t c} t >i is a positive and increasing sequence. It follows, that 

inf 1 _ 1 

i<n<m 2c T T lN c 2c T S M c' 

and we have shown that 

lim sups 2 log E xo t < — ^ — . 

e^O 2c 1 h M C 

Since this inequality holds for any integer M > 1, we actually have 

lim sups 2 log E xo t < -j= — , (12) 
and the proof is finished. 
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Lemma 3.4. For {X^} t > and t as in Theorem \3.^\ we have 



liminfe logE (0) ... j0F (r) > 

where £7(o...,o) T denotes that the starting point of the process is Xq = (0, . . . , 0) T . 

Proof: For each t > 1, c T Xf has a univariate normal distribution with 
mean zero (since xo is now chosen to be the zero vector) and variance 
Var(c T Xf) = e 2 c T £ t c. In the proof of lemma I3UI we showed that {c T S t c}(>i 
is an increasing sequence, and thus we have 

Var(c T X 4 £ ) < e 2 c T E ooC Vt > 1. 

The statement of the lemma then follows immediately from theorem 12.21 

We illustrate the result in theorem 13.21 by simulating a bivariate process 
{X^} t > , where 

XZ = AXU+e£ t , 

where Xf = (X^Xyf G R 2 Vt > 1, X = (0, 0) T , {6}t>i is an i.i.d. se- 

0.8 1 



quence of bivariate standard normal random variables and A . . 

' (J. 5 

Since the eigenvalues of A (0.8 and 0.5) are less than one in absolute value, 
the process has a stationary distribution. Let c = (1, 1) T and consider the 
exit time r = min{t > 1 : \Xf x + Xf 2 \ > 1}. The matrix Eoo is calculated 
from equality [9l We get 

/ 925 10 

V -- I 81 9 
^oo — I 10 12 



Theorem O says that \im £ ^ e 2 log Er = l/(2c T E 00 c) = 81/2426 sa 0.03339. 
We use the statistical programming package R to simulate paths of the pro- 
cess for a few values of e. For each value of e, 100 paths are simulated and 
the mean exit time is calculated. The results are shown in table [TJ For 
e = 0.12, the mean exit time is around 84, while it is around 6 000 000 for 
e = 0.05. Naturally, the simulations become more and more time-consuming 
as e decreases and the mean exit time increases. 
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e 


0.12 


0.10 


0.08 


0.07 


0.06 


0.05 


e 2 log Et 


0.0639 


0.0554 


0.0473 


0.0434 


0.0415 


0.0389 



Table 1: Simulation of a bivariate autoregressive process 



4 Exit times for the autoregressive process of 
order n 

We will now use the methods in section 2 for the univariate autoregressive 
process of order n with normally distributed noise. 



4.1 The autoregressive process of order n 

The autoregressive process of order n is defined as the process {X £ }t>o, where 

X s = b 1 X £ _ 1 + . . . + b n Xl_ n + e£ t , Xq = x , . . . , X £ n _ x = x n -i. (13) 

Here X\ e R \/t > 0, b± : . . . , b n are real parameters, e is a positive parameter 
and {it}t>n is an i.i.d. sequence of standard normal (univariate) random 
variables. We consider the exit time from the interval (—1, 1), that is, 



r(_ 1)1 ) = min{t > n : \Xl\ > 1}. 



(14) 



The process can actually be seen as a multivariate process. Let Y t £ : = 
(X e , . . . 1 Xf_ n+l ) T Mt > n — 1. Then {Y t £ } t > n is a multivariate process that 
satisfies 



Y t e = BYU + s{tu 0, • . • , 0) T for t > n, Y*_ x = (x n . u . . . , x f \ 



where 



B 



( h b 2 
l o 



• b n \ 
■ 

o ••. 

••• 1 / 

This process is similar to but not exactly like the multivariate autoregressive 
process that we considered in section 3. For each t > n, Yf has a multivariate 



V 
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normal distribution with mean EY t £ = B l n+1 (x n -i, . . . , x Q ) T and covariance 
matrix £" 2 Ef, where E t is given by 

E t = BE t - 1 B T + (1,0,..., 0) T (1,0,...,0) fort > n + 1, 
E n = (1,0,...,0) T (1,0,...,0), 

or by the sum 

t—n 

£ t = ^£ fe (l,0,...,0) T (l,0,...,0)(5 T )*. 

fc=0 

Throughout this section, we make the assumption that bi,...,b n are such 
that all eigenvalues of the matrix B have absolute values less than one. Then 
the process {Y t £ } t > n ~i has a stationary distribution which is normal with the 
zero vector as mean and covariance matrix c~ 2 Xoo, where 

= 5E oo fi T + (l,0,...,0) T (l,0,...,0), or 

oo 

^oo = 5> fe (l,0,...,0) T (l,0,...,0)(5 T ) fc . 

This implies that the original univariate process {Xf} t > has a stationary 
distribution which is normal with mean zero and variance e 2 a 2 , where a 2 is 
given by 

oo 

- 2 = 5>n) 2 > (15) 

fc=0 

where B\ x denotes the element at the first row and the first column of the 
matrix B k . 

4.2 Exit times from an interval 

Under the assumption that the starting points Xq, . . . ,x n -i are zeroes, we 
have the following result for the exit time 

Theorem 4.1. For the autoregressive process {X t e } t > of order n, and the 
exit time T(-i,i), 

lime 2 log £ ( o,...,o)T(-i,i) = 

assuming that all eigenvalues of B are less than one in absolute value, and 
that Xq = . . . = x n _i = 0. 
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Proof: We use the large deviation principle to get an upper bound of the 
limit. The logarithmic moment generating function of (£, 0, . . . , 0) T , where £ 
is a standard normal random variable, is 

A(A) = log£(e Al «) = ^, 
where A = (Ai, . . . , A n ) T . Thus, the Fenchel-Legendre transform of A(A) is 

A*(z) = sup (X T z - A(A)) = ( 2"' if f 2 = • • • = Zn = , 
x e Rn [ oo, otherwise. 

where z = (zi, . . . , z n ) T G M. n . Cramer's theorem (see for example [TJ) 
now gives us that the family of probability measures induced by the fam- 
ily 0, . . . , 0) T } satisfies the large deviation principle with rate of speed 
e 2 and rate function / e (§,o,...,o) T ( ;Z ) = A*(z), z G M". By theorem 12 .![ the large 
deviation principle then holds for the family of probability measures induced 
by {Y t £ } t > n _i with rate of speed e 2 and rate function 

oo 

I(yn-l,Vn,---) = Wt n ^/ t ((,0,.J)r(2() 

y t =By t -i+zt, t>n t=n 
y„-i = (x n -i,...,xo) T 

\YZn((yt-Byt-i)i? if (y t -By t ^) k = 

Vk = 2,...,n, Vt>n 
and y n _i = (x n _i, . . . ,x ) T 
oo, otherwise, 

where y n -\,y n , . . . G M. n and (y t — By t -i)k denotes the fc:th element of the 
vector y t — By t -\. We now proceed as in the multivariate autoregressive 
case. This time, we consider exits from the set Q = {x G M. d : \c T x\ < 1} 
for the vector c = (1,0,..., 0) T . For this fl, inf yn _ lg Q P Un _ 1 (r(_i ) i) < M) = 
P(o,...,o) T (n-i,i) < M) and 

lime 2 log P( ,...,o) T (n-i,i) < M) = - inf I(y n _ u y nj . . .) 

max„<j< M |c J yt\>l 
SM_l=(0,...0r 

= ~ jf&J .T^ KVn-UVn, ■■■)), 

n<N<M |c T yjvl>l 

y„_i=(0,...,0)r 
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where 

N 



inf 

\c T y N \>l 



HVn-i, y n , ■■■)= inf - y2(y t - By t ^ x ) (y t - By t ^). 
\c T y N \>i 2 f-f 

Ifa-l=(O,... I 0)T y„_ 1 =(0,...,0)T 

(y t -By t -i) k =0, 2<k<nyt>n 

(16) 

As in the multivariate autoregressive case, one can use the Cauchy-Schwarz 
inequality to show that the sum on the right hand side in equality [TH] is 
larger than or equal to l/(2c T £jvc), where T, N is defined as in section 4.1. 
Equality is achieved for y t = KS((B A,_1 ) r c, t = n, . . . , N where the choice 
of K = l/(|c T SArc|) gives |c t t/tv| = 1- It is easy to check that this sequence 
{yt\t=n indeed satisfies (y t — By t -\)k — for k — 2, . . . , n and t = n, . . . N. 
Thus, we get 

1 N 1 
^ inf o y^iVt ~ B Vt-i) T (Vt ~ By t ~i) = — , 

\c T y N \>l 2c 1 T, N c 

j /n _ 1 =(o,...,or 

(yt-By t -i) k =0, 2<k<n,Vt>n 

which implies that 

. 2 . - , . _ 1 



lime IogP (0 ,.„ >0)a .(r ( _ 1|1) <M) = - 2drEwC . 

Inequality |5] now gives us that limsup £ ^ e 2 logi?( I . 0j ... jXn _ 1 )T(_ 1]1 ) < l/(2c T Sjvfc), 
and since this holds for any M > n, we may substitute T,^ for £ M . Since 
c = (1,0,..., 0) T , c T SooC = a 2 , and we have 



limsup^logE^,...^.,)^!,!) < — . (17) 

e->0 2(T Z 



Thus, we have the desired upper bound for any starting points x , . . . , x n _i G 
(— 1, 1). For the corresponding lower bound, we make the additional assump- 
tion that Xq = . . . = x n _i = 0. Then Xf has a normal distribution with mean 
zero and variance 



t—n 



<% = ]>>n) 2 < o\ 



k=0 



Theorem 12.21 then immediately gives us 

1 



limjnfe 2 log £ (0 ,...,o) > — . (18) 
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The upper and lower bounds in inequalities [T7] and [TH] together imply that 



lime 2 logE ( o,..., )r ( _i i i ) = — , (19) 
and the proof is finished. 
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